Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 1232950 |
| Missing cells | 15863 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 804.3 MiB |
| Average record size in memory | 684.0 B |
Variable types
| Numeric | 9 |
|---|---|
| DateTime | 1 |
| Text | 4 |
| Categorical | 5 |
ARREST_BORO is highly overall correlated with ARREST_PRECINCT and 1 other fields | High correlation |
ARREST_PRECINCT is highly overall correlated with ARREST_BORO | High correlation |
KY_CD is highly overall correlated with LAW_CAT_CD | High correlation |
LAW_CAT_CD is highly overall correlated with KY_CD | High correlation |
Latitude is highly overall correlated with Y_COORD_CD | High correlation |
Longitude is highly overall correlated with X_COORD_CD | High correlation |
X_COORD_CD is highly overall correlated with ARREST_BORO and 1 other fields | High correlation |
Y_COORD_CD is highly overall correlated with Latitude | High correlation |
LAW_CAT_CD is highly imbalanced (58.7%) | Imbalance |
PERP_SEX is highly imbalanced (55.9%) | Imbalance |
Latitude is highly skewed (γ1 = -65.43980723) | Skewed |
Longitude is highly skewed (γ1 = 310.4297308) | Skewed |
ARREST_KEY has unique values | Unique |
JURISDICTION_CODE has 1096835 (89.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-10-15 20:29:05.396581 |
|---|---|
| Analysis finished | 2025-10-15 20:29:30.089087 |
| Duration | 24.69 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
ARREST_KEY
Real number (ℝ)
Unique
| Distinct | 1232950 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2082439 × 108 |
| Minimum | 9944197 |
|---|---|
| Maximum | 2.7977973 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 9944197 |
|---|---|
| 5-th percentile | 1.7329321 × 108 |
| Q1 | 1.9187172 × 108 |
| median | 2.1769636 × 108 |
| Q3 | 2.4878704 × 108 |
| 95-th percentile | 2.7481404 × 108 |
| Maximum | 2.7977973 × 108 |
| Range | 2.6983554 × 108 |
| Interquartile range (IQR) | 56915326 |
Descriptive statistics
| Standard deviation | 33238290 |
|---|---|
| Coefficient of variation (CV) | 0.15051911 |
| Kurtosis | -1.1195912 |
| Mean | 2.2082439 × 108 |
| Median Absolute Deviation (MAD) | 28005762 |
| Skewness | 0.17435533 |
| Sum | 2.7226543 × 1014 |
| Variance | 1.1047839 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 279197226 | 1 | < 0.1% |
| 197562265 | 1 | < 0.1% |
| 198858308 | 1 | < 0.1% |
| 198236576 | 1 | < 0.1% |
| 197957452 | 1 | < 0.1% |
| 198283837 | 1 | < 0.1% |
| 197524780 | 1 | < 0.1% |
| 198340574 | 1 | < 0.1% |
| 198409670 | 1 | < 0.1% |
| 198853690 | 1 | < 0.1% |
| Other values (1232940) | 1232940 |
| Value | Count | Frequency (%) |
| 9944197 | 1 | |
| 9956777 | 1 | |
| 9958671 | 1 | |
| 10110560 | 1 | |
| 10111247 | 1 | |
| 10125673 | 1 | |
| 10201950 | 1 | |
| 10350608 | 1 | |
| 10412552 | 1 | |
| 10554163 | 1 |
| Value | Count | Frequency (%) |
| 279779734 | 1 | |
| 279767587 | 1 | |
| 279767582 | 1 | |
| 279767580 | 1 | |
| 279767578 | 1 | |
| 279767574 | 1 | |
| 279767307 | 1 | |
| 279767302 | 1 | |
| 279766988 | 1 | |
| 279766987 | 1 |
ARREST_DATE
Date
| Distinct | 2787 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.4 MiB |
| Minimum | 2006-01-03 00:00:00 |
|---|---|
| Maximum | 2023-12-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
PD_CD
Real number (ℝ)
| Distinct | 321 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 731 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 433.19162 |
| Minimum | 0 |
|---|---|
| Maximum | 997 |
| Zeros | 11 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 101 |
| Q1 | 117 |
| median | 397 |
| Q3 | 705 |
| 95-th percentile | 922 |
| Maximum | 997 |
| Range | 997 |
| Interquartile range (IQR) | 588 |
Descriptive statistics
| Standard deviation | 276.98689 |
|---|---|
| Coefficient of variation (CV) | 0.63940961 |
| Kurtosis | -1.1690892 |
| Mean | 433.19162 |
| Median Absolute Deviation (MAD) | 284 |
| Skewness | 0.33734418 |
| Sum | 5.3378695 × 108 |
| Variance | 76721.735 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 101 | 140949 | 11.4% |
| 339 | 118910 | 9.6% |
| 109 | 76754 | 6.2% |
| 922 | 64629 | 5.2% |
| 397 | 56869 | 4.6% |
| 779 | 48394 | 3.9% |
| 439 | 47882 | 3.9% |
| 511 | 40936 | 3.3% |
| 244 | 29909 | 2.4% |
| 113 | 28726 | 2.3% |
| Other values (311) | 578261 |
| Value | Count | Frequency (%) |
| 0 | 11 | < 0.1% |
| 1 | 3 | < 0.1% |
| 2 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 12 | 9 | < 0.1% |
| 15 | 278 | < 0.1% |
| 16 | 1370 | |
| 29 | 35 | < 0.1% |
| 30 | 4 | < 0.1% |
| 35 | 42 | < 0.1% |
| Value | Count | Frequency (%) |
| 997 | 20 | < 0.1% |
| 973 | 11 | < 0.1% |
| 972 | 16 | < 0.1% |
| 970 | 2 | < 0.1% |
| 969 | 13343 | |
| 968 | 258 | < 0.1% |
| 965 | 35 | < 0.1% |
| 963 | 14 | < 0.1% |
| 961 | 48 | < 0.1% |
| 957 | 4 | < 0.1% |
PD_DESC
Text
| Distinct | 411 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1663 |
| Missing (%) | 0.1% |
| Memory size | 87.5 MiB |
Length
| Max length | 54 |
|---|---|
| Median length | 44 |
| Mean length | 25.514791 |
| Min length | 6 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | STRANGULATION 1ST |
|---|---|
| 2nd row | STRANGULATION 1ST |
| 3rd row | RAPE 3 |
| 4th row | RAPE 1 |
| 5th row | (null) |
| Value | Count | Frequency (%) |
| assault | 226041 | 6.6% |
| 3 | 208530 | 6.1% |
| from | 173188 | 5.1% |
| open | 166792 | 4.9% |
| areas | 138986 | 4.1% |
| larceny,petit | 119196 | 3.5% |
| controlled | 86469 | 2.5% |
| possession | 82398 | 2.4% |
| traffic,unclassified | 77972 | 2.3% |
| 2 | 77341 | 2.3% |
| Other values (525) | 2054256 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2994694 | 9.5% |
| A | 2783998 | 8.9% |
| E | 2670856 | 8.5% |
| I | 2335651 | 7.4% |
| N | 2270492 | 7.2% |
| 2238889 | 7.1% | |
| R | 1721424 | 5.5% |
| L | 1588810 | 5.1% |
| T | 1557689 | 5.0% |
| C | 1479748 | 4.7% |
| Other values (35) | 9773780 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 31416031 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 2994694 | 9.5% |
| A | 2783998 | 8.9% |
| E | 2670856 | 8.5% |
| I | 2335651 | 7.4% |
| N | 2270492 | 7.2% |
| 2238889 | 7.1% | |
| R | 1721424 | 5.5% |
| L | 1588810 | 5.1% |
| T | 1557689 | 5.0% |
| C | 1479748 | 4.7% |
| Other values (35) | 9773780 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 31416031 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 2994694 | 9.5% |
| A | 2783998 | 8.9% |
| E | 2670856 | 8.5% |
| I | 2335651 | 7.4% |
| N | 2270492 | 7.2% |
| 2238889 | 7.1% | |
| R | 1721424 | 5.5% |
| L | 1588810 | 5.1% |
| T | 1557689 | 5.0% |
| C | 1479748 | 4.7% |
| Other values (35) | 9773780 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 31416031 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 2994694 | 9.5% |
| A | 2783998 | 8.9% |
| E | 2670856 | 8.5% |
| I | 2335651 | 7.4% |
| N | 2270492 | 7.2% |
| 2238889 | 7.1% | |
| R | 1721424 | 5.5% |
| L | 1588810 | 5.1% |
| T | 1557689 | 5.0% |
| C | 1479748 | 4.7% |
| Other values (35) | 9773780 |
KY_CD
Real number (ℝ)
High correlation
| Distinct | 72 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2250 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 254.62639 |
| Minimum | 101 |
|---|---|
| Maximum | 995 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 105 |
| Q1 | 116 |
| median | 238 |
| Q3 | 344 |
| 95-th percentile | 361 |
| Maximum | 995 |
| Range | 894 |
| Interquartile range (IQR) | 228 |
Descriptive statistics
| Standard deviation | 150.45477 |
|---|---|
| Coefficient of variation (CV) | 0.59088443 |
| Kurtosis | 5.9284099 |
| Mean | 254.62639 |
| Median Absolute Deviation (MAD) | 113 |
| Skewness | 1.702305 |
| Sum | 3.133687 × 108 |
| Variance | 22636.639 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 344 | 188559 | |
| 341 | 119196 | 9.7% |
| 106 | 102426 | 8.3% |
| 126 | 72190 | 5.9% |
| 348 | 67559 | 5.5% |
| 235 | 62665 | 5.1% |
| 105 | 57125 | 4.6% |
| 109 | 55586 | 4.5% |
| 117 | 48467 | 3.9% |
| 359 | 39127 | 3.2% |
| Other values (62) | 417800 |
| Value | Count | Frequency (%) |
| 101 | 8293 | 0.7% |
| 102 | 42 | < 0.1% |
| 103 | 162 | < 0.1% |
| 104 | 4649 | 0.4% |
| 105 | 57125 | |
| 106 | 102426 | |
| 107 | 33441 | 2.7% |
| 109 | 55586 | |
| 110 | 8174 | 0.7% |
| 111 | 8629 | 0.7% |
| Value | Count | Frequency (%) |
| 995 | 9454 | |
| 882 | 20 | < 0.1% |
| 881 | 14193 | |
| 880 | 392 | < 0.1% |
| 685 | 28 | < 0.1% |
| 678 | 1730 | 0.1% |
| 677 | 4655 | 0.4% |
| 676 | 28 | < 0.1% |
| 675 | 2095 | 0.2% |
| 672 | 4 | < 0.1% |
OFNS_DESC
Text
| Distinct | 86 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1663 |
| Missing (%) | 0.1% |
| Memory size | 81.2 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 36 |
| Mean length | 20.081346 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FELONY ASSAULT |
|---|---|
| 2nd row | FELONY ASSAULT |
| 3rd row | RAPE |
| 4th row | RAPE |
| 5th row | (null) |
| Value | Count | Frequency (%) |
| 308756 | 8.2% | |
| offenses | 296971 | 7.8% |
| assault | 290985 | 7.7% |
| related | 281366 | 7.4% |
| 3 | 189545 | 5.0% |
| larceny | 182956 | 4.8% |
| dangerous | 162467 | 4.3% |
| petit | 119196 | 3.1% |
| drugs | 111132 | 2.9% |
| felony | 106957 | 2.8% |
| Other values (136) | 1736681 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2668647 | 10.8% |
| 2555725 | 10.3% | |
| A | 2243407 | 9.1% |
| S | 2174964 | 8.8% |
| L | 1582035 | 6.4% |
| N | 1535922 | 6.2% |
| R | 1525781 | 6.2% |
| T | 1342675 | 5.4% |
| O | 1204948 | 4.9% |
| F | 1157864 | 4.7% |
| Other values (31) | 6733932 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 24725900 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 2668647 | 10.8% |
| 2555725 | 10.3% | |
| A | 2243407 | 9.1% |
| S | 2174964 | 8.8% |
| L | 1582035 | 6.4% |
| N | 1535922 | 6.2% |
| R | 1525781 | 6.2% |
| T | 1342675 | 5.4% |
| O | 1204948 | 4.9% |
| F | 1157864 | 4.7% |
| Other values (31) | 6733932 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 24725900 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 2668647 | 10.8% |
| 2555725 | 10.3% | |
| A | 2243407 | 9.1% |
| S | 2174964 | 8.8% |
| L | 1582035 | 6.4% |
| N | 1535922 | 6.2% |
| R | 1525781 | 6.2% |
| T | 1342675 | 5.4% |
| O | 1204948 | 4.9% |
| F | 1157864 | 4.7% |
| Other values (31) | 6733932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 24725900 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 2668647 | 10.8% |
| 2555725 | 10.3% | |
| A | 2243407 | 9.1% |
| S | 2174964 | 8.8% |
| L | 1582035 | 6.4% |
| N | 1535922 | 6.2% |
| R | 1525781 | 6.2% |
| T | 1342675 | 5.4% |
| O | 1204948 | 4.9% |
| F | 1157864 | 4.7% |
| Other values (31) | 6733932 |
LAW_CODE
Text
| Distinct | 1797 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 51 |
| Missing (%) | < 0.1% |
| Memory size | 69.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9997753 |
| Min length | 6 |
Unique
| Unique | 293 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PL 1211200 |
|---|---|
| 2nd row | PL 1211300 |
| 3rd row | PL 1302503 |
| 4th row | PL 1303501 |
| 5th row | PL 2407800 |
| Value | Count | Frequency (%) |
| pl | 1096395 | |
| 1200001 | 124038 | 5.3% |
| 1552500 | 118910 | 5.1% |
| 2200300 | 40936 | 1.8% |
| vtl0511001 | 39340 | 1.7% |
| 215510b | 39316 | 1.7% |
| 1200502 | 31190 | 1.3% |
| 1200501 | 29838 | 1.3% |
| 1553001 | 28145 | 1.2% |
| 1201401 | 23159 | 1.0% |
| Other values (1793) | 761748 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3223464 | |
| 1 | 1933544 | |
| L | 1214933 | 9.9% |
| 5 | 1206664 | 9.8% |
| 2 | 1160435 | 9.4% |
| 1100116 | 8.9% | |
| P | 1099558 | 8.9% |
| 3 | 279687 | 2.3% |
| 6 | 235665 | 1.9% |
| 4 | 232931 | 1.9% |
| Other values (31) | 641716 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12328713 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3223464 | |
| 1 | 1933544 | |
| L | 1214933 | 9.9% |
| 5 | 1206664 | 9.8% |
| 2 | 1160435 | 9.4% |
| 1100116 | 8.9% | |
| P | 1099558 | 8.9% |
| 3 | 279687 | 2.3% |
| 6 | 235665 | 1.9% |
| 4 | 232931 | 1.9% |
| Other values (31) | 641716 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12328713 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3223464 | |
| 1 | 1933544 | |
| L | 1214933 | 9.9% |
| 5 | 1206664 | 9.8% |
| 2 | 1160435 | 9.4% |
| 1100116 | 8.9% | |
| P | 1099558 | 8.9% |
| 3 | 279687 | 2.3% |
| 6 | 235665 | 1.9% |
| 4 | 232931 | 1.9% |
| Other values (31) | 641716 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12328713 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3223464 | |
| 1 | 1933544 | |
| L | 1214933 | 9.9% |
| 5 | 1206664 | 9.8% |
| 2 | 1160435 | 9.4% |
| 1100116 | 8.9% | |
| P | 1099558 | 8.9% |
| 3 | 279687 | 2.3% |
| 6 | 235665 | 1.9% |
| 4 | 232931 | 1.9% |
| Other values (31) | 641716 | 5.2% |
LAW_CAT_CD
Categorical
High correlation Imbalance
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9505 |
| Missing (%) | 0.8% |
| Memory size | 58.8 MiB |
| M | |
|---|---|
| F | |
| V | 9657 |
| I | 2247 |
| 9 | 1067 |
Length
| Max length | 6 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000082 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 701764 | |
| F | 508708 | |
| V | 9657 | 0.8% |
| I | 2247 | 0.2% |
| 9 | 1067 | 0.1% |
| (null) | 2 | < 0.1% |
| (Missing) | 9505 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 701764 | |
| f | 508708 | |
| v | 9657 | 0.8% |
| i | 2247 | 0.2% |
| 9 | 1067 | 0.1% |
| null | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 701764 | |
| F | 508708 | |
| V | 9657 | 0.8% |
| I | 2247 | 0.2% |
| 9 | 1067 | 0.1% |
| l | 4 | < 0.1% |
| ( | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1223455 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 701764 | |
| F | 508708 | |
| V | 9657 | 0.8% |
| I | 2247 | 0.2% |
| 9 | 1067 | 0.1% |
| l | 4 | < 0.1% |
| ( | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1223455 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 701764 | |
| F | 508708 | |
| V | 9657 | 0.8% |
| I | 2247 | 0.2% |
| 9 | 1067 | 0.1% |
| l | 4 | < 0.1% |
| ( | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1223455 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 701764 | |
| F | 508708 | |
| V | 9657 | 0.8% |
| I | 2247 | 0.2% |
| 9 | 1067 | 0.1% |
| l | 4 | < 0.1% |
| ( | 2 | < 0.1% |
| n | 2 | < 0.1% |
| u | 2 | < 0.1% |
| ) | 2 | < 0.1% |
ARREST_BORO
Categorical
High correlation
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.8 MiB |
| K | |
|---|---|
| M | |
| B | |
| Q | |
| S |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | K |
| 3rd row | K |
| 4th row | B |
| 5th row | Q |
Common Values
| Value | Count | Frequency (%) |
| K | 336140 | |
| M | 305202 | |
| B | 281217 | |
| Q | 256888 | |
| S | 53503 | 4.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| k | 336140 | |
| m | 305202 | |
| b | 281217 | |
| q | 256888 | |
| s | 53503 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| K | 336140 | |
| M | 305202 | |
| B | 281217 | |
| Q | 256888 | |
| S | 53503 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1232950 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| K | 336140 | |
| M | 305202 | |
| B | 281217 | |
| Q | 256888 | |
| S | 53503 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1232950 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| K | 336140 | |
| M | 305202 | |
| B | 281217 | |
| Q | 256888 | |
| S | 53503 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1232950 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| K | 336140 | |
| M | 305202 | |
| B | 281217 | |
| Q | 256888 | |
| S | 53503 | 4.3% |
ARREST_PRECINCT
Real number (ℝ)
High correlation
| Distinct | 77 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.697747 |
| Minimum | 1 |
|---|---|
| Maximum | 123 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 40 |
| median | 62 |
| Q3 | 100 |
| 95-th percentile | 115 |
| Maximum | 123 |
| Range | 122 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 34.915222 |
|---|---|
| Coefficient of variation (CV) | 0.55688161 |
| Kurtosis | -1.1598091 |
| Mean | 62.697747 |
| Median Absolute Deviation (MAD) | 28 |
| Skewness | 0.083351921 |
| Sum | 77303187 |
| Variance | 1219.0727 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 37920 | 3.1% |
| 40 | 36913 | 3.0% |
| 44 | 36182 | 2.9% |
| 75 | 35295 | 2.9% |
| 52 | 29250 | 2.4% |
| 113 | 28587 | 2.3% |
| 46 | 28307 | 2.3% |
| 43 | 26376 | 2.1% |
| 103 | 25788 | 2.1% |
| 47 | 25692 | 2.1% |
| Other values (67) | 922640 |
| Value | Count | Frequency (%) |
| 1 | 13888 | 1.1% |
| 5 | 20408 | |
| 6 | 12940 | 1.0% |
| 7 | 11857 | 1.0% |
| 9 | 11923 | 1.0% |
| 10 | 10503 | 0.9% |
| 13 | 15842 | |
| 14 | 37920 | |
| 17 | 6955 | 0.6% |
| 18 | 19038 |
| Value | Count | Frequency (%) |
| 123 | 5964 | 0.5% |
| 122 | 9818 | 0.8% |
| 121 | 14538 | |
| 120 | 23183 | |
| 115 | 20938 | |
| 114 | 22483 | |
| 113 | 28587 | |
| 112 | 9188 | 0.7% |
| 111 | 5191 | 0.4% |
| 110 | 20030 |
JURISDICTION_CODE
Real number (ℝ)
Zeros
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4749901 |
| Minimum | 0 |
|---|---|
| Maximum | 97 |
| Zeros | 1096835 |
| Zeros (%) | 89.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 97 |
| Range | 97 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 10.498557 |
|---|---|
| Coefficient of variation (CV) | 7.1177134 |
| Kurtosis | 68.334887 |
| Mean | 1.4749901 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.2861721 |
| Sum | 1818589 |
| Variance | 110.21969 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1096835 | |
| 1 | 49786 | 4.0% |
| 2 | 46242 | 3.8% |
| 3 | 12518 | 1.0% |
| 97 | 10478 | 0.8% |
| 72 | 3367 | 0.3% |
| 11 | 2119 | 0.2% |
| 15 | 1869 | 0.2% |
| 4 | 1800 | 0.1% |
| 73 | 1758 | 0.1% |
| Other values (19) | 6178 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 1096835 | |
| 1 | 49786 | 4.0% |
| 2 | 46242 | 3.8% |
| 3 | 12518 | 1.0% |
| 4 | 1800 | 0.1% |
| 6 | 1433 | 0.1% |
| 7 | 866 | 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 124 | < 0.1% |
| 11 | 2119 | 0.2% |
| Value | Count | Frequency (%) |
| 97 | 10478 | |
| 88 | 128 | < 0.1% |
| 87 | 472 | < 0.1% |
| 85 | 136 | < 0.1% |
| 79 | 18 | < 0.1% |
| 76 | 4 | < 0.1% |
| 74 | 13 | < 0.1% |
| 73 | 1758 | 0.1% |
| 72 | 3367 | 0.3% |
| 71 | 959 | 0.1% |
AGE_GROUP
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 63.4 MiB |
| 25-44 | |
|---|---|
| 45-64 | |
| 18-24 | |
| <18 | 51512 |
| 65+ | 17393 |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.8882274 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 25-44 |
|---|---|
| 2nd row | 25-44 |
| 3rd row | 25-44 |
| 4th row | 45-64 |
| 5th row | <18 |
Common Values
| Value | Count | Frequency (%) |
| 25-44 | 679238 | |
| 45-64 | 244211 | 19.8% |
| 18-24 | 240596 | 19.5% |
| <18 | 51512 | 4.2% |
| 65+ | 17393 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 25-44 | 679238 | |
| 45-64 | 244211 | 19.8% |
| 18-24 | 240596 | 19.5% |
| 18 | 51512 | 4.2% |
| 65 | 17393 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 2087494 | |
| - | 1164045 | |
| 5 | 940842 | |
| 2 | 919834 | |
| 1 | 292108 | 4.8% |
| 8 | 292108 | 4.8% |
| 6 | 261604 | 4.3% |
| < | 51512 | 0.9% |
| + | 17393 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6026940 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 2087494 | |
| - | 1164045 | |
| 5 | 940842 | |
| 2 | 919834 | |
| 1 | 292108 | 4.8% |
| 8 | 292108 | 4.8% |
| 6 | 261604 | 4.3% |
| < | 51512 | 0.9% |
| + | 17393 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6026940 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 2087494 | |
| - | 1164045 | |
| 5 | 940842 | |
| 2 | 919834 | |
| 1 | 292108 | 4.8% |
| 8 | 292108 | 4.8% |
| 6 | 261604 | 4.3% |
| < | 51512 | 0.9% |
| + | 17393 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6026940 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 2087494 | |
| - | 1164045 | |
| 5 | 940842 | |
| 2 | 919834 | |
| 1 | 292108 | 4.8% |
| 8 | 292108 | 4.8% |
| 6 | 261604 | 4.3% |
| < | 51512 | 0.9% |
| + | 17393 | 0.3% |
PERP_SEX
Categorical
Imbalance
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.8 MiB |
| M | |
|---|---|
| F | |
| U | 3504 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 1012257 | |
| F | 217189 | 17.6% |
| U | 3504 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 1012257 | |
| f | 217189 | 17.6% |
| u | 3504 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1012257 | |
| F | 217189 | 17.6% |
| U | 3504 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1232950 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 1012257 | |
| F | 217189 | 17.6% |
| U | 3504 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1232950 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 1012257 | |
| F | 217189 | 17.6% |
| U | 3504 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1232950 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 1012257 | |
| F | 217189 | 17.6% |
| U | 3504 | 0.3% |
PERP_RACE
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.4 MiB |
| BLACK | |
|---|---|
| WHITE HISPANIC | |
| WHITE | |
| BLACK HISPANIC | |
| ASIAN / PACIFIC ISLANDER | |
| Other values (2) | 11790 |
Length
| Max length | 30 |
|---|---|
| Median length | 5 |
| Mean length | 9.1854666 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WHITE |
|---|---|
| 2nd row | BLACK |
| 3rd row | BLACK |
| 4th row | BLACK |
| 5th row | WHITE HISPANIC |
Common Values
| Value | Count | Frequency (%) |
| BLACK | 597247 | |
| WHITE HISPANIC | 309593 | |
| WHITE | 136439 | 11.1% |
| BLACK HISPANIC | 110957 | 9.0% |
| ASIAN / PACIFIC ISLANDER | 66924 | 5.4% |
| UNKNOWN | 8295 | 0.7% |
| AMERICAN INDIAN/ALASKAN NATIVE | 3495 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| black | 708204 | |
| white | 446032 | |
| hispanic | 420550 | |
| asian | 66924 | 3.6% |
| 66924 | 3.6% | |
| pacific | 66924 | 3.6% |
| islander | 66924 | 3.6% |
| unknown | 8295 | 0.4% |
| american | 3495 | 0.2% |
| indian/alaskan | 3495 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1568808 | |
| A | 1420915 | |
| C | 1266097 | |
| H | 866582 | 7.7% |
| L | 778623 | 6.9% |
| K | 719994 | 6.4% |
| B | 708204 | 6.3% |
| 628312 | 5.5% | |
| N | 596758 | 5.3% |
| S | 557893 | 4.9% |
| Other values (12) | 2213035 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11325221 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| I | 1568808 | |
| A | 1420915 | |
| C | 1266097 | |
| H | 866582 | 7.7% |
| L | 778623 | 6.9% |
| K | 719994 | 6.4% |
| B | 708204 | 6.3% |
| 628312 | 5.5% | |
| N | 596758 | 5.3% |
| S | 557893 | 4.9% |
| Other values (12) | 2213035 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11325221 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| I | 1568808 | |
| A | 1420915 | |
| C | 1266097 | |
| H | 866582 | 7.7% |
| L | 778623 | 6.9% |
| K | 719994 | 6.4% |
| B | 708204 | 6.3% |
| 628312 | 5.5% | |
| N | 596758 | 5.3% |
| S | 557893 | 4.9% |
| Other values (12) | 2213035 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11325221 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| I | 1568808 | |
| A | 1420915 | |
| C | 1266097 | |
| H | 866582 | 7.7% |
| L | 778623 | 6.9% |
| K | 719994 | 6.4% |
| B | 708204 | 6.3% |
| 628312 | 5.5% | |
| N | 596758 | 5.3% |
| S | 557893 | 4.9% |
| Other values (12) | 2213035 |
X_COORD_CD
Real number (ℝ)
High correlation
| Distinct | 58803 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1005358.6 |
| Minimum | 0 |
|---|---|
| Maximum | 1067302 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 978360.9 |
| Q1 | 991303 |
| median | 1004992 |
| Q3 | 1017314 |
| 95-th percentile | 1043685 |
| Maximum | 1067302 |
| Range | 1067302 |
| Interquartile range (IQR) | 26011 |
Descriptive statistics
| Standard deviation | 21375.253 |
|---|---|
| Coefficient of variation (CV) | 0.021261322 |
| Kurtosis | 5.3528094 |
| Mean | 1005358.6 |
| Median Absolute Deviation (MAD) | 12959 |
| Skewness | -0.32542428 |
| Sum | 1.2395569 × 1012 |
| Variance | 4.5690143 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1017119 | 8738 | 0.7% |
| 1026486 | 7127 | 0.6% |
| 1046315 | 6589 | 0.5% |
| 1006537 | 6273 | 0.5% |
| 987220 | 5798 | 0.5% |
| 1007694 | 5613 | 0.5% |
| 997897 | 5325 | 0.4% |
| 1020183 | 5232 | 0.4% |
| 1041879 | 5226 | 0.4% |
| 1032084 | 5170 | 0.4% |
| Other values (58793) | 1171859 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 913512 | 3 | |
| 913554 | 2 | |
| 913818 | 3 | |
| 913844 | 1 | < 0.1% |
| 913942 | 1 | < 0.1% |
| 914031 | 1 | < 0.1% |
| 914103 | 4 | |
| 914151 | 1 | < 0.1% |
| 914210 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1067302 | 1 | < 0.1% |
| 1067220 | 1 | < 0.1% |
| 1067185 | 6 | |
| 1067117 | 2 | < 0.1% |
| 1067113 | 1 | < 0.1% |
| 1067018 | 1 | < 0.1% |
| 1066940 | 1 | < 0.1% |
| 1066928 | 1 | < 0.1% |
| 1066898 | 1 | < 0.1% |
| 1066856 | 4 |
Y_COORD_CD
Real number (ℝ)
High correlation
| Distinct | 63551 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 208285.97 |
| Minimum | 0 |
|---|---|
| Maximum | 6253476 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 158770 |
| Q1 | 185878 |
| median | 206925 |
| Q3 | 236095 |
| 95-th percentile | 254446 |
| Maximum | 6253476 |
| Range | 6253476 |
| Interquartile range (IQR) | 50217 |
Descriptive statistics
| Standard deviation | 30586.234 |
|---|---|
| Coefficient of variation (CV) | 0.14684731 |
| Kurtosis | 1482.824 |
| Mean | 208285.97 |
| Median Absolute Deviation (MAD) | 24137 |
| Skewness | 8.0701838 |
| Sum | 2.5680619 × 1011 |
| Variance | 9.3551774 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 183909 | 8727 | 0.7% |
| 234533 | 7710 | 0.6% |
| 262591 | 7116 | 0.6% |
| 187088 | 6593 | 0.5% |
| 244511 | 6266 | 0.5% |
| 212676 | 5791 | 0.5% |
| 216954 | 5170 | 0.4% |
| 218129 | 4905 | 0.4% |
| 183789 | 4892 | 0.4% |
| 207813 | 4656 | 0.4% |
| Other values (63541) | 1171124 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 121131 | 1 | < 0.1% |
| 121152 | 1 | < 0.1% |
| 121312 | 2 | |
| 121390 | 2 | |
| 121474 | 3 | |
| 121508 | 3 | |
| 121540 | 1 | < 0.1% |
| 121545 | 1 | < 0.1% |
| 121681 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6253476 | 1 | < 0.1% |
| 4245690 | 1 | < 0.1% |
| 271909 | 2 | |
| 271820 | 2 | |
| 271819 | 3 | |
| 271730 | 4 | |
| 271698 | 3 | |
| 271547 | 1 | < 0.1% |
| 271349 | 2 | |
| 271323 | 1 | < 0.1% |
Latitude
Real number (ℝ)
High correlation Skewed
| Distinct | 134233 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.738295 |
| Minimum | 0 |
|---|---|
| Maximum | 57.070187 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.602464 |
| Q1 | 40.676808 |
| median | 40.734637 |
| Q3 | 40.814679 |
| 95-th percentile | 40.865034 |
| Maximum | 57.070187 |
| Range | 57.070187 |
| Interquartile range (IQR) | 0.13787089 |
Descriptive statistics
| Standard deviation | 0.091575853 |
|---|---|
| Coefficient of variation (CV) | 0.0022479059 |
| Kurtosis | 32752.534 |
| Mean | 40.738295 |
| Median Absolute Deviation (MAD) | 0.066243893 |
| Skewness | -65.439807 |
| Sum | 50228281 |
| Variance | 0.0083861368 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.67141166 | 5722 | 0.5% |
| 40.81039849 | 4908 | 0.4% |
| 40.67998074 | 4617 | 0.4% |
| 40.88733282 | 4573 | 0.4% |
| 40.68004873 | 4207 | 0.3% |
| 40.64502275 | 4114 | 0.3% |
| 40.72629309 | 3825 | 0.3% |
| 40.84413995 | 3576 | 0.3% |
| 40.64886713 | 3421 | 0.3% |
| 40.70744736 | 3277 | 0.3% |
| Other values (134223) | 1190710 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 40.49890536 | 1 | |
| 40.49895701 | 1 | |
| 40.499393 | 1 | |
| 40.49940083 | 1 | |
| 40.499616 | 2 | |
| 40.49985024 | 1 | |
| 40.49985024 | 2 | |
| 40.49994 | 2 | |
| 40.49994754 | 1 |
| Value | Count | Frequency (%) |
| 57.07018725 | 1 | < 0.1% |
| 51.7403346 | 1 | < 0.1% |
| 40.91295931 | 2 | |
| 40.9127234 | 2 | |
| 40.912714 | 3 | |
| 40.91247643 | 2 | |
| 40.91246813 | 2 | |
| 40.912382 | 3 | |
| 40.911964 | 1 | < 0.1% |
| 40.91142553 | 1 | < 0.1% |
Longitude
Real number (ℝ)
High correlation Skewed
| Distinct | 135479 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.92374 |
| Minimum | -74.254377 |
|---|---|
| Maximum | 0 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 1232949 |
| Negative (%) | > 99.9% |
| Memory size | 9.4 MiB |
Quantile statistics
| Minimum | -74.254377 |
|---|---|
| 5-th percentile | -74.021211 |
| Q1 | -73.974565 |
| median | -73.925119 |
| Q3 | -73.8806 |
| 95-th percentile | -73.785624 |
| Maximum | 0 |
| Range | 74.254377 |
| Interquartile range (IQR) | 0.093964674 |
Descriptive statistics
| Standard deviation | 0.10180417 |
|---|---|
| Coefficient of variation (CV) | -0.0013771513 |
| Kurtosis | 225490.05 |
| Mean | -73.92374 |
| Median Absolute Deviation (MAD) | 0.046796527 |
| Skewness | 310.42973 |
| Sum | -91144276 |
| Variance | 0.01036409 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -73.88151172 | 5722 | 0.5% |
| -73.84725001 | 4573 | 0.4% |
| -73.91945797 | 4328 | 0.4% |
| -73.77590919 | 4207 | 0.3% |
| -73.87999831 | 3865 | 0.3% |
| -73.73476085 | 3825 | 0.3% |
| -73.90059136 | 3667 | 0.3% |
| -73.91536345 | 3641 | 0.3% |
| -73.92489531 | 3555 | 0.3% |
| -73.9508219 | 3421 | 0.3% |
| Other values (135469) | 1192146 |
| Value | Count | Frequency (%) |
| -74.254377 | 3 | |
| -74.25422295 | 2 | |
| -74.253256 | 3 | |
| -74.253187 | 1 | < 0.1% |
| -74.25285143 | 1 | < 0.1% |
| -74.252525 | 1 | < 0.1% |
| -74.25225064 | 4 | |
| -74.25208299 | 1 | < 0.1% |
| -74.25185131 | 1 | < 0.1% |
| -74.251844 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| -73.70029335 | 1 | < 0.1% |
| -73.70059685 | 1 | < 0.1% |
| -73.700717 | 2 | |
| -73.70072029 | 4 | |
| -73.7009566 | 2 | |
| -73.70098353 | 1 | < 0.1% |
| -73.70132362 | 1 | < 0.1% |
| -73.701605 | 1 | < 0.1% |
| -73.7016123 | 1 | < 0.1% |
Lon_Lat
Text
| Distinct | 143286 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 104.1 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 44 |
| Mean length | 39.571399 |
| Min length | 11 |
Unique
| Unique | 48718 ? |
|---|---|
| Unique (%) | 4.0% |
Sample
| 1st row | POINT (-73.985702 40.76539) |
|---|---|
| 2nd row | POINT (-73.95082 40.648859) |
| 3rd row | POINT (-73.9305713255961 40.6744956865259) |
| 4th row | POINT (-73.9005768807295 40.8535983673823) |
| 5th row | POINT (-73.901881 40.699373) |
| Value | Count | Frequency (%) |
| point | 1232950 | |
| 73.88151172399995 | 5722 | 0.2% |
| 40.67141166300007 | 5722 | 0.2% |
| 73.92489531099994 | 4908 | 0.1% |
| 40.810398494000026 | 4908 | 0.1% |
| 40.67998073800004 | 4617 | 0.1% |
| 40.88733281800006 | 4573 | 0.1% |
| 73.84725001299995 | 4573 | 0.1% |
| 73.91945797099999 | 4328 | 0.1% |
| 73.77590919399995 | 4207 | 0.1% |
| Other values (269696) | 2422342 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6406371 | |
| 9 | 5739971 | |
| 7 | 3890755 | 8.0% |
| 4 | 3570251 | 7.3% |
| 3 | 3195555 | 6.5% |
| 8 | 2749732 | 5.6% |
| 6 | 2469866 | 5.1% |
| 2465900 | 5.1% | |
| . | 2465898 | 5.1% |
| 5 | 2246118 | 4.6% |
| Other values (10) | 13589139 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 48789556 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6406371 | |
| 9 | 5739971 | |
| 7 | 3890755 | 8.0% |
| 4 | 3570251 | 7.3% |
| 3 | 3195555 | 6.5% |
| 8 | 2749732 | 5.6% |
| 6 | 2469866 | 5.1% |
| 2465900 | 5.1% | |
| . | 2465898 | 5.1% |
| 5 | 2246118 | 4.6% |
| Other values (10) | 13589139 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 48789556 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6406371 | |
| 9 | 5739971 | |
| 7 | 3890755 | 8.0% |
| 4 | 3570251 | 7.3% |
| 3 | 3195555 | 6.5% |
| 8 | 2749732 | 5.6% |
| 6 | 2469866 | 5.1% |
| 2465900 | 5.1% | |
| . | 2465898 | 5.1% |
| 5 | 2246118 | 4.6% |
| Other values (10) | 13589139 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 48789556 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6406371 | |
| 9 | 5739971 | |
| 7 | 3890755 | 8.0% |
| 4 | 3570251 | 7.3% |
| 3 | 3195555 | 6.5% |
| 8 | 2749732 | 5.6% |
| 6 | 2469866 | 5.1% |
| 2465900 | 5.1% | |
| . | 2465898 | 5.1% |
| 5 | 2246118 | 4.6% |
| Other values (10) | 13589139 |
Interactions
Correlations
| AGE_GROUP | ARREST_BORO | ARREST_KEY | ARREST_PRECINCT | JURISDICTION_CODE | KY_CD | LAW_CAT_CD | Latitude | Longitude | PD_CD | PERP_RACE | PERP_SEX | X_COORD_CD | Y_COORD_CD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AGE_GROUP | 1.000 | 0.028 | 0.030 | 0.031 | 0.017 | 0.052 | 0.047 | 0.000 | 0.000 | 0.081 | 0.059 | 0.022 | 0.003 | 0.000 |
| ARREST_BORO | 0.028 | 1.000 | 0.012 | 0.884 | 0.030 | 0.056 | 0.051 | 0.001 | 0.000 | 0.087 | 0.171 | 0.015 | 0.562 | 0.000 |
| ARREST_KEY | 0.030 | 0.012 | 1.000 | 0.013 | -0.048 | -0.067 | 0.047 | 0.003 | 0.011 | -0.080 | 0.019 | 0.072 | 0.011 | 0.003 |
| ARREST_PRECINCT | 0.031 | 0.884 | 0.013 | 1.000 | -0.085 | 0.007 | 0.047 | -0.480 | 0.368 | 0.046 | 0.148 | 0.014 | 0.369 | -0.479 |
| JURISDICTION_CODE | 0.017 | 0.030 | -0.048 | -0.085 | 1.000 | 0.075 | 0.020 | 0.024 | -0.031 | 0.056 | 0.026 | 0.010 | -0.031 | 0.023 |
| KY_CD | 0.052 | 0.056 | -0.067 | 0.007 | 0.075 | 1.000 | 0.726 | 0.015 | 0.004 | 0.161 | 0.035 | 0.055 | 0.004 | 0.015 |
| LAW_CAT_CD | 0.047 | 0.051 | 0.047 | 0.047 | 0.020 | 0.726 | 1.000 | 0.010 | 0.000 | 0.296 | 0.031 | 0.040 | 0.017 | 0.015 |
| Latitude | 0.000 | 0.001 | 0.003 | -0.480 | 0.024 | 0.015 | 0.010 | 1.000 | 0.290 | -0.054 | 0.008 | 0.000 | 0.289 | 1.000 |
| Longitude | 0.000 | 0.000 | 0.011 | 0.368 | -0.031 | 0.004 | 0.000 | 0.290 | 1.000 | -0.007 | 0.011 | 0.000 | 1.000 | 0.291 |
| PD_CD | 0.081 | 0.087 | -0.080 | 0.046 | 0.056 | 0.161 | 0.296 | -0.054 | -0.007 | 1.000 | 0.044 | 0.094 | -0.007 | -0.054 |
| PERP_RACE | 0.059 | 0.171 | 0.019 | 0.148 | 0.026 | 0.035 | 0.031 | 0.008 | 0.011 | 0.044 | 1.000 | 0.071 | 0.103 | 0.000 |
| PERP_SEX | 0.022 | 0.015 | 0.072 | 0.014 | 0.010 | 0.055 | 0.040 | 0.000 | 0.000 | 0.094 | 0.071 | 1.000 | 0.011 | 0.000 |
| X_COORD_CD | 0.003 | 0.562 | 0.011 | 0.369 | -0.031 | 0.004 | 0.017 | 0.289 | 1.000 | -0.007 | 0.103 | 0.011 | 1.000 | 0.290 |
| Y_COORD_CD | 0.000 | 0.000 | 0.003 | -0.479 | 0.023 | 0.015 | 0.015 | 1.000 | 0.291 | -0.054 | 0.000 | 0.000 | 0.290 | 1.000 |
Missing values
Sample
| ARREST_KEY | ARREST_DATE | PD_CD | PD_DESC | KY_CD | OFNS_DESC | LAW_CODE | LAW_CAT_CD | ARREST_BORO | ARREST_PRECINCT | JURISDICTION_CODE | AGE_GROUP | PERP_SEX | PERP_RACE | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | Lon_Lat | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 279197226 | 12/19/2023 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211200 | F | M | 18 | 0 | 25-44 | M | WHITE | 988210.0 | 218129.0 | 40.765390 | -73.985702 | POINT (-73.985702 40.76539) |
| 1 | 278761840 | 12/09/2023 | 105.0 | STRANGULATION 1ST | 106.0 | FELONY ASSAULT | PL 1211300 | F | K | 67 | 0 | 25-44 | M | BLACK | 997897.0 | 175676.0 | 40.648859 | -73.950820 | POINT (-73.95082 40.648859) |
| 2 | 278506761 | 12/05/2023 | 153.0 | RAPE 3 | 104.0 | RAPE | PL 1302503 | F | K | 77 | 0 | 25-44 | M | BLACK | 1003509.0 | 185018.0 | 40.674496 | -73.930571 | POINT (-73.9305713255961 40.6744956865259) |
| 3 | 278436408 | 12/03/2023 | 157.0 | RAPE 1 | 104.0 | RAPE | PL 1303501 | F | B | 46 | 0 | 45-64 | M | BLACK | 1011755.0 | 250279.0 | 40.853598 | -73.900577 | POINT (-73.9005768807295 40.8535983673823) |
| 4 | 278248753 | 11/29/2023 | 660.0 | (null) | NaN | (null) | PL 2407800 | M | Q | 104 | 0 | <18 | M | WHITE HISPANIC | 1011456.0 | 194092.0 | 40.699373 | -73.901881 | POINT (-73.901881 40.699373) |
| 5 | 278254593 | 11/29/2023 | 464.0 | JOSTLING | 230.0 | JOSTLING | PL 1652501 | M | M | 18 | 0 | <18 | M | WHITE HISPANIC | 990503.0 | 215519.0 | 40.758225 | -73.977428 | POINT (-73.977428 40.758225) |
| 6 | 277850807 | 11/21/2023 | 263.0 | ARSON 2,3,4 | 114.0 | ARSON | PL 1501001 | F | K | 63 | 71 | 25-44 | M | WHITE | 1000734.0 | 164367.0 | 40.617813 | -73.940621 | POINT (-73.940621 40.617813) |
| 7 | 276523582 | 10/26/2023 | 177.0 | SEXUAL ABUSE | 116.0 | SEX CRIMES | PL 2603204 | F | M | 28 | 0 | 25-44 | M | BLACK | 997407.0 | 233806.0 | 40.808418 | -73.952474 | POINT (-73.9524740603515 40.8084177460021) |
| 8 | 276466505 | 10/25/2023 | 157.0 | RAPE 1 | 104.0 | RAPE | PL 1303501 | F | K | 77 | 0 | 25-44 | M | BLACK | 1003509.0 | 185018.0 | 40.674496 | -73.930571 | POINT (-73.9305713255961 40.6744956865259) |
| 9 | 276391494 | 10/24/2023 | 168.0 | SODOMY 1 | 116.0 | SEX CRIMES | PL 1305004 | F | K | 77 | 0 | 45-64 | M | WHITE | 1003509.0 | 185018.0 | 40.674496 | -73.930571 | POINT (-73.9305713255961 40.6744956865259) |
| ARREST_KEY | ARREST_DATE | PD_CD | PD_DESC | KY_CD | OFNS_DESC | LAW_CODE | LAW_CAT_CD | ARREST_BORO | ARREST_PRECINCT | JURISDICTION_CODE | AGE_GROUP | PERP_SEX | PERP_RACE | X_COORD_CD | Y_COORD_CD | Latitude | Longitude | Lon_Lat | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1232940 | 169473028 | 09/18/2017 | 203.0 | TRESPASS 3, CRIMINAL | 352.0 | CRIMINAL TRESPASS | PL 1401000 | M | S | 123 | 0 | 65+ | M | WHITE | 931204.0 | 140539.0 | 40.552272 | -74.190887 | POINT (-74.19088650399993 40.55227230600008) |
| 1232941 | 170379964 | 10/13/2017 | 744.0 | BAIL JUMPING 3 | 359.0 | OFFENSES AGAINST PUBLIC ADMINISTRATION | PL 2155500 | M | Q | 102 | 0 | 25-44 | F | WHITE | 1032428.0 | 198872.0 | 40.712411 | -73.826217 | POINT (-73.82621729999995 40.71241149700006) |
| 1232942 | 170122094 | 10/06/2017 | 205.0 | TRESPASS 2, CRIMINAL | 352.0 | CRIMINAL TRESPASS | PL 1401500 | M | M | 10 | 2 | <18 | F | WHITE HISPANIC | 982756.0 | 210140.0 | 40.743470 | -74.005393 | POINT (-74.00539300399998 40.74347045500008) |
| 1232943 | 169630758 | 09/22/2017 | 567.0 | MARIJUANA, POSSESSION 4 & 5 | 235.0 | DANGEROUS DRUGS | PL 2211001 | M | B | 41 | 0 | 18-24 | M | BLACK HISPANIC | 1014604.0 | 238800.0 | 40.822083 | -73.890330 | POINT (-73.89033032599998 40.82208252700008) |
| 1232944 | 170079628 | 10/05/2017 | 567.0 | MARIJUANA, POSSESSION 4 & 5 | 235.0 | DANGEROUS DRUGS | PL 2211001 | M | B | 46 | 0 | 18-24 | M | BLACK | 1010882.0 | 247996.0 | 40.847335 | -73.903742 | POINT (-73.90374165199995 40.847334887000045) |
| 1232945 | 170410613 | 10/15/2017 | 113.0 | MENACING,UNCLASSIFIED | 344.0 | ASSAULT 3 & RELATED OFFENSES | PL 1201401 | M | B | 43 | 0 | 45-64 | M | BLACK | 1021019.0 | 238939.0 | 40.822440 | -73.867152 | POINT (-73.86715175299997 40.82243966900006) |
| 1232946 | 170287427 | 10/11/2017 | 109.0 | ASSAULT 2,1,UNCLASSIFIED | 106.0 | FELONY ASSAULT | PL 1200600 | F | B | 44 | 0 | <18 | F | BLACK | 1006537.0 | 244511.0 | 40.837782 | -73.919458 | POINT (-73.91945797099999 40.83778161800007) |
| 1232947 | 170148082 | 10/07/2017 | 269.0 | MISCHIEF,CRIMINAL, UNCL 2ND DEG 3RD DEG | 121.0 | CRIMINAL MISCHIEF & RELATED OFFENSES | PL 1450500 | F | M | 13 | 97 | 25-44 | M | BLACK | 986109.0 | 210622.0 | 40.744793 | -73.993293 | POINT (-73.99329254599996 40.74479335700005) |
| 1232948 | 170311307 | 10/12/2017 | 109.0 | ASSAULT 2,1,UNCLASSIFIED | 106.0 | FELONY ASSAULT | PL 1200512 | F | K | 67 | 0 | 25-44 | M | BLACK HISPANIC | 997897.0 | 175677.0 | 40.648867 | -73.950822 | POINT (-73.95082189999994 40.64886713300007) |
| 1232949 | 169598110 | 09/21/2017 | 729.0 | FORGERY,ETC.,UNCLASSIFIED-FELONY | 113.0 | FORGERY | PL 1704001 | F | B | 46 | 0 | 25-44 | M | WHITE HISPANIC | 1010820.0 | 250782.0 | 40.854982 | -73.903955 | POINT (-73.90395470599998 40.85498181500003) |